Reinforcement Learning: An Introduction: Learning Through Interaction: The RL Paradigm

Reinforcement Learning (RL) is more than just an algorithm; it is a computational approach to learning from interaction. Unlike supervised learning, where a teacher provides the "correct" answer, RL is centered on a goal-directed agent that must discover which actions yield the most reward through trial and error.

The Sensorimotor Connection

Learning is grounded in a direct sensorimotor connection to the environment. Imagine a robotic arm learning to pick up a fragile object:

Sensation: The robot perceives the object's position through cameras (state).
Action: The robot moves its joints (influence).
Goal: To lift the object without breaking it (the success metric).

Every choice changes the state of the world the agent is sensing. The robot strengthens the connections that lead to a successful grasp without needing a human to program every millimeter of movement.